RuLegalNER: a new dataset for Russian legal named entities recognition
Annotation
We address the scarcity of datasets specifically tailored for legal NER in the Russian language and investigate the generalization capabilities of models towards unseen named entities. A rule-based program developed by legal experts at Tag-Consulting Company was employed to automatically annotate legal texts and create the RuLegalNER dataset. Part of the named entities only exists in the development and test splits, and they are unseen in the training set. RuBERT was utilized as the base architecture for experimental evaluation. Two different architectural extensions were explored: RuBERT with CRF and RuBERT with adapters. These architectures were used to train and evaluate NER models on the RuLegalNER dataset. Utilize RuLegalNER to train and evaluate legal NER models, enhancing performance in the legal domain and studying generalization on unseen entities. A published version of RuLegalNER is presented with detailed statistics and demonstration of the usefulness of RuLegalNER by evaluating modern architectures.
Keywords
Постоянный URL
Articles in current issue
- Determination of the action type of hydrate formationinhibitors by their infrared spectra
- Application of Raman spectroscopy to study the inactivation process of bacterial microorganisms
- Numerical study of the effect of methemoglobin concentration in the blood on the absorption of light by human skin.
- Low-temperature cell for IR Fourier spectrometric investigation of hydrocarbon substances
- Peculiarities of growing Ga1–xInxAs solid solutions on GaAs substrates in the field of a temperature gradient through a thin gas zone
- An enhanced AES-GCM based security protocol for securing the IoT communication
- Attacks based on malicious perturbations on image processing systems and defense methods against them
- Brain MRT image super resolution using discrete cosine transform and convolutional neural network
- Text augmentation preserving persona speech style and vocabulary
- Verification of event-driven software systems using the specification language of cooperating automata objects
- Intelligent adaptive testing system
- Neural network-based method for visual recognition of driver’s voice commands using attention mechanism
- Brain tumour segmentation in MRI using fuzzy deformable fusion model with Dolphin-SCA
- Optimization of human tracking systems in virtual reality based on a neural network approach
- Errors in the demodulation algorithm with a generated carrier phase introduted by the low-pass filter
- Modeling of the process of spherical form correction for rotors of electrostatically suspended gyros
- Method of spatial multiplexing in multi-antenna communication systems
- Modeling and simulation of heat exchanger with strong dependence of oil viscosity on temperature
- Approach to the generalized parameters formation of the complex technical systems technical condition using neural network structures
- Numerical simulation of gas dynamics during operation of a wide-range rocket nozzle with a porous insert
- The exact solution of a shock wave reflection problem from a wall shielded by a gas suspension layer
- Adaptive observer for state variables of a time-varying nonlinear system with unknown constant parameters and delayed measurements